SEMANTIC INFORMATION PREPROCESSING FOR NATURALLANGUAGE INTERFACES TO DATABASESbyMilan

نویسنده

  • Milan Mosny
چکیده

A natural language interface to a database (NLID) needs both syntactic information about the structure of language and semantic information about what words and phrases mean with respect to the database. A semantic part of the NLID can implicitly or explicitly provide constraints on the input language. A parser can use these constraints to resolve ambiguities and to decrease overall response time. Our approach is to extract these constraints from the semantic description of the database domain and incorporate them semi-automatically or automatically into information directly accessible to the parser. The advantage of this approach is a greater degree of system modularity, which usually reduces complexity, reduces the number of possible errors in the system and makes it possible to develop di erent parts of the system concurrently by di erent persons and thus reducing development time. Also domain independent syntactic information can be reused from domain to domain, then customized according to the semantic information from a speci c domain. To implement the idea, Abductive Equivalential Translation (AET) was chosen to describe the database related semantics. AET provides a formalism which describes how a \literal" logical form of an input sentence consisting of lexical predicates can be translated to a logical form consisting of predicates meaningful to the database engine. The information used in the translation process is a Linguistic Domain Theory (LDT) based on logic. We shall constrain the expressive power of LDT to suit tractability and e ciency requirements and introduce Restricted Linguistic Domain Theory (RLDT). The main step for incorporation of semantic constraints into the syntax formalism is then extensive preprocessing of the semantic information described by an RLDT into Normalized Linguistic Domain Theory (NLDT). The system uses NLDT to produce selectional restrictions that follow from the semantic description of a domain by RLDT. Selectional restrictions in general iii state which words can be immediately combined with which other words. Once NLDT is constructed, it can be used as a main source of information for semantic processing of a sentence. Thanks to the soundness and completeness of the normalization process, the designer of the interface has possibility to express the semantic knowledge in more declarative terms.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Semantic Parsers: An Important but Under-Studied Problem

Computational systems that learn to transform naturallanguage sentences into semantic representations have important practical applications in building naturallanguage interfaces. They can also provide insight into important issues in human language acquisition. However, within AI, computational linguistics, and machine learning, there has been relatively little research on developing systems t...

متن کامل

Semantic Completely Preprocessing for Deep Web Queries Translation

Local schemas embedded in query interfaces of Deep Web are partial of database schemas. However, local schemas are extracted incompletely form query interfaces coded by semi-structure programming language. Some key attributes missed lead to form incompletely semantic queries. To tackle the problem, a semantic completely preprocessing for queries is presented based on Ontology instrument and ins...

متن کامل

Semantic Information Preprocessing for Natural Language Interfaces to Databases

An approach is described for supplying se-lectional restrictions to parsers in natural language interfaces (NLIs) to databases by extracting the selectional restrictions from semantic descriptions of those NLIs. Automating the process of finding selectional restrictions reduces NLI development time and may avoid errors introduced by hand-coding selectional restrictions.

متن کامل

Learning for Semantic Parsing

Semantic parsing is the task of mapping a natural language sentence into a complete, formal meaning representation. Over the past decade, we have developed a number of machine learning methods for inducing semantic parsers by training on a corpus of sentences paired with their meaning representations in a specified formal language. We have demonstrated these methods on the automated constructio...

متن کامل

Probabilistic Soft Logic for Semantic Textual Similarity

Probabilistic Soft Logic (PSL) is a recently developed framework for probabilistic logic. We use PSL to combine logical and distributional representations of natural-language meaning, where distributional information is represented in the form of weighted inference rules. We apply this framework to the task of Semantic Textual Similarity (STS) (i.e. judging the semantic similarity of naturallan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996